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ON THE CONTRACTION METHOD WITH DEGENERATE LIMIT 

EQUATION 

By Ralph Neininger 1 and Ludger Ruschendorf 
J. W. Goethe University and Universitat Freiburg 



A class of random recursive sequences (Y„) with slowly varying 
variances as arising for parameters of random trees or recursive al- 
gorithms leads after normalizations to degenerate limit equations of 
the form X = X. For nondegenerate limit equations the contraction 
method is a main tool to establish convergence of the scaled sequence 
to the "unique" solution of the limit equation. In this paper we de- 
velop an extension of the contraction method which allows us to de- 
rive limit theorems for parameters of algorithms and data structures 
with degenerate limit equation. In particular, we establish some new 
tools and a general convergence scheme, which transfers information 
on mean and variance into a central limit law (with normal limit). 
We also obtain a convergence rate result. For the proof we use selfde- 
composability properties of the limit normal distribution which allow 
us to mimic the recursive sequence by an accompanying sequence in 
normal variables. 



1. Introduction and degenerate limit equations. A large number of pa- 
rameters of recursive combinatorial structures, random trees and recursive 
algorithms satisfy recurrences of the divide-and-conquer type 

K 



where Ir are random subgroup sizes in {0, ...,n}, b n is a toll function 

and (yj r) ) n >o, r = 1, . . . ,K, are independent copies of the parameter, cor- 

responding to the contribution of subgroup r, = denotes equality in dis- 
tribution. Typical parameters Y n range from the depths and path lengths 
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of trees, the number of various substructures in combinatorial structures, 
the number of comparisons, space requirements and other cost measures of 
algorithms to parameters of communication models, and many more. 

The contraction method is an efficient and quite universal probabilistic 
tool for the asymptotic analysis of recurrences as in (1). It has been in- 
troduced for the analysis of the Quicksort algorithm in Rosier (1991) and 
further developed independently in Rosier (1992, 2001) and Rachev and 
Rtischendorf (1995), see also the survey article of Rosier and Riischendorf 
(2001). It has been applied since then successfully to a large number of 
problems. 

Recently, a fairly general unifying limit theorem for this type of recurrence 
has been obtained by the contraction method in Neininger and Riischendorf 
(2004) in the nondegenerate case, where the limit distribution of the nor- 
malized recurrence is uniquely characterized by a fixed point equation; we 
give an illustrative example below. By this result one, in general, obtains the 
limit distribution from the limiting recurrence and asymptotics of moments. 

The aim of this paper is to extend the contraction method and to state 
a general limit theorem for the degenerate case. In the degenerate case the 
characterizing equations for the normalized algorithm degenerate in the limit 
to the trivial equation X = X and, thus, give no indication on the limit 
distribution. This case is also quite common in many examples. To simplify 
the discussion we consider in the first part of the paper recursive sequences 
(Yn)n>o which satisfy the distributional recurrence in (1) in the most basic 
setting, where K = 1, that is, we assume that 



where no > 1, (I n ,b n ), are independent, b n is random and I n is a random 
index in {0, . . . , n} with P(I n = n) < 1 for n > uq. Later on in Section 5 we 
come back to the more general case as in (1). 

To derive a limit in distribution for (Y n ) as in (2) by the contraction 
method the first step is to introduce a scaling of Y n , say X n := (Y n — fi n )/a n , 
where a n = EY„ and a n = ^/Var(Y n ) and to derive a recurrence relation for 



(2) 



Y n = Y In + h 



n>riQ 



(3) 




n > no 



where 



b {n) := — (b n -Hn + Hi n ) 



and with independence relations as in (2). 
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The next step to prove a limit theorem for X n is to establish convergence 
of the random coefficients in the recursive equation (3): 

(4) — — > A, feW b, 

On 

thus, leading to a limit equation of the form 

(5) X = AX + b. 

Here, (A, b) and X are independent. Essential for the application of the 
contraction method is that the limit equation (5) has a unique solution 
under appropriate constraints. The final step of the method is to establish 
convergence of the X n to the solution of the limit equation (5). 

Many examples of such an approach in the field of analysis of recursive al- 
gorithms can be found in Cramer and Riischendorf (1996), Neininger and Riischendorf 
(2004), Rosier (1991, 2001) and Rosier and Riischendorf (2001) and the ref- 
erences therein. 

As a typical example of this approach, we consider the Quickselect algo- 
rithm which is designed similarly to the Quicksort algorithm and, as a result, 
yields a fixed order statistic x^k) °f an ?>tuple of real numbers x\, . . . ,x n . If 
Y n denotes the number of comparisons this algorithm needs to find im , then, 
under the assumption that all permutations of (xi) are equally likely, Y n sat- 
isfies (2), where J n ~ unif{0, . . . , n — 1}, b n = n — 1, no = 2, and Yq = Y± = 0. 
It is known for this recursion that expectation and variance are of the orders 
KY n = 2n + O(l) and Var(y n ) = n 2 /2 + o(n 2 ), so that, noting that I n /n has 
a continuous unif [0, 1] distributed random variable U as its limit, we obtain, 
after scaling and deriving the limits in (4) , a limit equation (5) with A = U 
and b = y/2{2U - 1), thus, 

(6) X = UX + V2(2U-1). 

The solution of this equation, rescaled by W = y/Tj2X + 1, satisfies the 
equation 

(7) W = UW + U, 

whose unique solution is the Dickman distribution, which is quite common 
in the analysis of algorithms, as well as in analytic number theory where it 
originated [see Hwang and Tsai (2002)]. Standard application of the contrac- 
tion method implies that the fixed point equation (6) has a unique solution 
C(X) and that the rescaled quantity (Y n — EY n ) / y / Vav(Y n ) converges in 
distribution to this fixed point. 

In this paper we discuss which appears quite often for parameters 

X n with logarithmic orders for the variance; see the examples below. Here, 
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in the limiting equation (5) we are led to the case A = l, 6 = 0, that is, to 
the degenerate limit equation 

X = X. 

The degenerate limit equation does not give any hint to a limit of the re- 
cursive sequence (X n ) and so the contraction method does not work in this 
case. 

We will focus in this paper on recursions of the form (2) and the extensions 
in (1) which lead to a degenerate limit equation and exhibit an asymptot- 
ically normal behavior for the scaled quantities X n . We will explain how 
the normal distribution comes up although the degenerate limit equation 
does not give any indication for asymptotic normality, and obtain general 
theorems which lead on the basis of information on mean and variance of Y n 
to a central limit law including a rate of convergence. Special cases of our 
setting are suitable to rederive and extend various limit laws from the field 
of analysis of algorithms including rates of convergence. 

First of all, note that if for Y n given in (2) we have that a\ = Var(Y„) ~ 
L(n) for n — > oo, with a function L being slowly varying at oo, we obtain 



a n y L(n) 

almost surely, if I n satisfies mild conditions [see (9)] typically satisfied for 
applications from the analysis of algorithms. If, furthermore, b n is appropri- 
ately small and M n ) = (b n — fi n + fj,i n ) — > almost surely, then we are led 
to the degenerate limit equation for the normalized sequence (X n ). There- 
fore, degenerate limit equations can be expected for quite general types of 
recursions. 

As an example for the degenerate case, consider the cost Y n of an unsuc- 
cessful search in a random binary search tree as discussed in Cramer and Riischendorf 
(1996) and Mahmoud (1992). Here, (Y n ) satisfies (2) with I n ~ unif{l, . . . , ra- 
il, b n = 1, no = 2 and Yq = Y\ = 0. The toll b n = 1 is small compared to the 
similar case of Quickselect considered above. In this case the expectation 
and variance satisfy EY n = 2 Inn + 0(1) and Var(5^) = 21nn + O(l). So the 
scaling now yields A = 1 and 6 = and thus leads to the degenerate limit 
equation. We come back to this example in Section 4. 

Since the case where the variance is a slowly varying function L(n) of the 
order (lnra) a with some a > (up to multiplicative constants) is common 
in the field of analysis of algorithms, we will restrict our setup to this case; 
for examples see Sections 4 and 5. 

The paper is organized as follows: Section 2 contains the basic central limit 
law, Theorem 2.1. In Section 3 tools are developed to handle degenerate limit 
equations leading to a proof of Theorem 2.1. In Section 4 as application a 
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couple of limit laws from the field of analysis of algorithms are rederived in a 
uniform setup. These were previously proven one by one. In the last section 
we extend our results to obtain central limit theorems for the more complex 
recurrences of the the divide-and-conquer type in (1). In particular, our limit 
law covers some more complicated problems related to a maximum-finding 
algorithm in a broadcast communication model as analyzed in Chen and 
Hwang (2003). 

2. A central limit law. Let (l^) n >o be a sequence of random variables 
satisfying the recursion 



(8) Y n ^Y In +b n , n>n , 

where no > 1, (I n ,b n ), (Yk) are independent, b n is random and I n a random 
index in {0, ... , n} with P(/ n = n) < 1 for n > no- We denote a n = yJ\ai(Y n ) 
and jjL n = EY n and use the convention In" n := (lnn) a for a > and n > 1. 
1 1 -X" ||p denotes the L p -norm of a random variable X . Then we have the fol- 
lowing central limit law, where M(0, 1) denotes the standard normal distri- 
bution. 

Theorem 2.1. Assume that (Y n ) n >o satisfies the recursion (8) with 
\\Y n \\3 < oo for all n > and 



'In V 1 

(9) limsupElnl — ) < 0, sup 



n— >oo V n J n >\ 



In 



J n Vl 



< oo. 

3 



Furthermore, assume that for real numbers a, A, k with < A < 2a, the mean 
and the variance of Y n satisfy 

(10) \\b n -fi n + fi In \\ 3 = 0(]n K n), a 2 n = Cln 2a n + 0(ln x n), 
with some constant C > 0. If 

(11) := | A 3(a - «) A 3(q - A/2) A (a - k + 1) > 1, 
then 

(12) Yn ~ EYn ^ AT (0,1) 

and we have the following rate of convergence for the Zolotarev-metric £3: 



G 
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The Zolotarev metric £3 is defined for distributions C(V), C(W) by 
( 3 (jC(V),£(W)):= sup \Ef(V)-Ef(W)\, 

where J= 3 := {/ e C 2 (R,R) : \f"{x) - f"(y)\ < \x - y\} is the space of all 
twice differentiate functions with second derivative being Lipschitz contin- 
uous with Lipschitz constant 1. We will use the short notation Cz(y,W) := 
Q 3 {C{V), C{W)). It is well known that convergence in £3 implies weak conver- 
gence and that ( 3 (V, W) < 00 if EV = EW, EV 2 = EW 2 , and \\V\\ 3 , \\W\\ 3 < 
00. The metric £3 is (3, +) ideal, that is, we have for T independent of (V, W) 
and c 7^ 

(14) ( 3 (V + T,W + T)<( 3 (V,W), UcV 1 cW) = \c\ i UV,W). 

For general reference and properties of £3 we refer to Zolotarev (1976, 1977) 
and Rachev (1991). For implications and interpretation of rates of conver- 
gence in the £3 metric see Neininger and Riischendorf (2002). 

3. Proof of the limit law. For the scaling of the Y n we have Var(l^) ~ 
Cln 2a n with some a > 0. Since the scaling of the recurrence requires a 
scaling for n = 0, 1 as well, we define for integers n > and real 5 > 0, 

L 5 (n) :=ln(nVl) + <5l {0jl }(n), 

where If denotes the indicator function of a set F. We use the convention 
L a s (n) :=(L 5 (n)) a for a > 0. 

To prepare for the proof of Theorem 2.1 we provide two calculus lemmas: 

Lemma 3.1. Let I n be a random variable in {0, . . . ,n} with P(/ n = n) < 
1 for all n sufficiently large and with limsup^^oo Eln((I n V l)/n) < — e for 
some e > 0. Let (d n ) n >o, (r n ) n > no be sequences of nonnegative numbers with 

'(L s {L n )\^ 



d„<E 



V L s (n) 



di„ 



+ r n , n>n >2, 



for some 7 > 0. Then for all 1 < j3 < 1 + 7 and 5 > sufficiently small, we 
have 

r n = o(-^-\ =^ d n = o( 



ln^n/ Vln' 3 l n 

Proof. We abbreviate rj : = 7 + 1 — (3 and choose 5 = e(r] A 1)/ (677) . There 
exists an n\ > uq and an M > with Eln((/ n V l)/n) < — e, p n := P(/ n = 
n) < 1, r n < A// h/n, and (1 + 5/liin) v < 1 + 2i]5/\nn for all n>n 1 . We 
define 

2M a 1 

R:=— -Vmax{4K l (k):0<k<n 1 \ 
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and prove d n < i?/L^~ 1 (n) by induction. For < n < m, there is, by def- 
inition of R, nothing to prove. For n > n±, we obtain, using the induction 
hypothesis, 



d n < Pnd n + E 

This implies 



(15) d n < 
(16) 



1 



/ Ls(In) 

{I ^ n - 1} \L 5 {n) 

(L s {in)y 



R 



+ 



M 
In' 3 n 



< 



(l-p^ln^n 
1 

(l-p^W 3 - 1 



PJ E 



'U*(n) 



Pn + 



M 
Inn 



R E 1 + 



ln((J n Vl)/n) + gy 
Inn 



M 
Inn 



For the estimate of the latter expectation we abbreviate Z := ln((I„ V l)/n) 
and the set F := {Z > —6}. Then we have, using (1 — x) a < 1 — ax for x > 0, 
0<a< 1, 



E 1 + 



z + 5 \i 

Inn 



<E 
< E 



If 
If 



1 + 
1 + 



5 ^ 



Inn 
2nS\ 
Inn J 



+ l F o[l + 



I +lfc( 1 + 



z + jy A1 

Inn / 
{nM){Z + 5) 
Inn 



<1 | 2r?J | (r ? Al)(EZ + <5) 
~~ In n In n 

With EZ < -e and noting that 5 < e(n A l)/(2(2n + (77 A 1))), we obtain the 
estimate 



E 1 + 



Z + 5V 



Inn 



< 1 



Plugging this into (15), we obtain 

1 



d n < 



(l-p^ln^n 
R 



R[l-Pn 



(77 A l)er 
2 Inn 



(1 Ar/)e 
2 Inn 



+ 



M 
Inn 



< 



In' 3 l n (1 — p n )ln /3 n 



(Re(r]Al)/2-M) 



by definition of i?. □ 

Lemma 3.2. For a// a > and integers n > 3 and 1 < i < n, we /tawe 



/Ini N 




2 Va 






) 






\lnn, 




Inn 





8 R. NEININGER AND L. RUSCHENDORF 

Proof. For i = 1, the assertion is true. For i > 2 and a > 1, we have, 
by the mean value theorem, for appropriate s £ [In 2, Inn], 



I ln a i — ln Q n\ = — z — as a 1 1 In z — lnrel < 



ln( - 

n 



ln a n ln a n In n 

We have 

1 u a- , a 1 |ln 2o i-ln 2a n| 1 2q . - 

— — ln Q z - ln a n = — — i -— — 1 < — = ln za i - hr Q n . 

ln°n 1 ln a n ln°i + ln Q n _ ln 2a n 

Thus, for < a < 1, doubling of the exponent a successively yields 
I ln a i — ln Q n\ < ; — |ln a i — ln a n\ 



ln a n' '~ln a n 
with a' £ [1)2). Then applying the first part implies the assertion. □ 

Proof of Theorem 2.1. We have Eln((/ n V l)/n) < -e for all n > 
n\ > uq and some e > 0. We define the scaled quantities 

Y n - KY n 

Z„ := —= , n > 0, 

VCL a s (n)' 



with a 5 > sufficiently small to be specified later and denote r n := \/Vav(Z n ) 
&nl iyC Li (n)). Thus, we have r n — > 1 for n — > 00. The sequence (Z n ) satis- 
fies the recurrence 

*=(^)" z '- +tW ' 

with 

{,(«) = 6 W(j nj 6 n ) = 1 (b w - /i n + MJ J. 
vCL 5 (n) 

Now we define N n := r n N, where iV is a standard normal distributed random 
variable independent of (I n ,b n ), and introduce an accompanying sequence 
(K) by 



V L s (n] 



-)V + & (n) , 



Note that Z n , N n , Z* have identical first and second moment, and finite 
absolute third moment. Thus, £3 distances between these random variables 
are finite. We have 



(17) 



UZn,N n ) < uz m zi) + ( 3 (Z* n , N n ). 
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Using that £3 is (3,+) ideal, compare (14), and conditioning on (I n ,b n ), we 
obtain 



( 3 (Z n ,Z*) = sup 



E 



/ 



cflP( 7 "> 6 ")(A,s) 



(18) 



< 



fc=0 

We will show below that 



C<(Mf)"W> ( M>, 
< £ p(/ n = fc) df^y) 3Q C 3 (^, JV fc ). 



(19) 



C 3 (^,iVn)=0 



In' 3 



with /3 given in (11). With this estimate, we obtain from (17) and (18) 
denoting d n := C 3 (Z n ,iV n ) and r n = ( 3 (Z*,N n ), 



d n = ( 3 (Z n ,N n ) 
n , 



k=0 

E 



L S (k) 
L s {n) 



3n 



dk) +r n 



( Ls(I n 



3a 



di„ 



+ r n . 



A L s (n) 

Lemma 3.1 then implies d n = ^ 3 (Z n , N n ) = 0(l/ln^ _1 n) since, by definition 
of /?, we have (3 < 1 + 3a and 5 can be chosen appropriately. Moreover, we 
obtain 



fin 



Af(0,l)) =( 3 [-Z n ,-N n 



1 



1 



1 



o 



(3(Z n ,N n ) 



1 



which is (13). Since £3 convergence implies weak convergence, we obtain 
(12). 

It remains to establish the bound (19) for ( 3 (Z*,N n ): We define 

fL 5 (i n y 



V L s (n) 
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thus, we have the representation Z* = G n N + b^ n \ From Var(Z*) = t 2 , we 
obtain, in particular, the relation 

(20) E[G 2 n + (b^) 2 ]-r 2 = 0. 

Using the closure of the normal familiy under convolution, we have, with 
the set A := {G n > r n } and its complement A c , the decompositions 

(21) Z* k \ A (j n N + VG 2 n - r 2 N' + 6W) + l A c(G n iV + &(»)), 

(22) N n k l A ( Tn N) + l A c(G n N+ Vr 2 - G 2 n N'), 

where N,N',G n are independent, N = N' . Subsequently, we abbreviate 
A n := \Gn - t 2 ] 1 / 2 and the right-hand sides in (21) and (22) by Z* and 
N n , respectively. 

We have to estimate |E[/(Z*) — f(N n )]\ uniformly for / G JT 3 . Taylor 
expansion around N yields f(x) = f(N) + f'(N)(x - N) + (l/2)/"(JV)(x - 
N) 2 + R(x,N) for x 6 R. Here we have |-R(a;,iV)| < (l/6)|x-iV| 3 since /" has 
Lipschitz constant 1. We may subsequently assume that f"(0) = 0. If f"(0) ^ 
0, consider g(x) := f(x) - (f"(Q)/2)x 2 . Then we have g"(0) = and, since 
Z*, N n have identical second moment, E[/(Z*) - f(N n )] = E[g(Z*) - g(N n )}. 

Using the Taylor expansion and representations (21) and (22), we have 

E[f(Z*) - f(N n )} = E[5i + S 2 + R(Z* n , N) - R(N n , N)}, 

where, for S\, we collect the terms involving the factor f'(N) and, for S 2 , 
we collect the terms involving the factor f"(N). Hence, after simplification 
and using that N, N' , and (G n ,b^) are independent, we obtain 

5 1 = f(N)(A n (l A - l Ae )N' + 6^), 

5 2 = ^(A 2 n (l A -l Ac )(N') 2 + (b^) 2 

+ 2b^N(l A ( Tn - 1) + l Ac (G n - 1))). 

Since EN' = E&( n ) = and by the independence between N and b^ and be- 
tween N' and (N, G n ), we obtain ESi = 0. For the estimate of ES 2 , first note 
that we have A„(1 A — 1a c ) = G^ — r„. Hence, with (20), the independence 
of N,N', (G n ,b^), and E(N') 2 = 1, we obtain 

E f^l (A 2 (U _ i Ac )(AT') 2 + (6 {n) ) 2 ) = 0. 

Furthermore, note that for f £ J 7 ^ with f"(0) = 0, we have 
|E[/"(JV)JV]| = |E[(r(JV)-r(0))JV]| 

<E[|/"(iV)-/"(0)||iV|] <EiV 2 = l. 
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Thus, with the independence of N to the other quantities, we obtain the 
bound 

|ES 2 | < E\b^(\ Tn - 1| + \G n - 1|)| < ||6 (n) || 2 (|r n - 1| + \\G n - 1|| 2 ). 

For the remainder terms we have the O-estimate 

E\R(Z*,N)\ < ±E[| (r n - 1)N + A n N' + b^f + \(G n - 1)N + 6 (n) | 3 ] 

= 0(\r n - 1| 3 + ||A„||1 + \\bM\\l + \\G n - 

The term K\R(N n , N)\ is bounded by the same O-term. Hence, altogether 
we obtain 



(23) 



C 3 (^,iV re ) = 0(\r n - 1| 3 + || A re ||! + ll^lls + \\G n - 1||1 

+ ||6(")|| 2 (|r n -l| + ||G n -l|| 2 )). 



For the estimate of the latter norms and distances note that, using Lemma 
3.2, we have 

\G n -l\= * \a In -VCln a n\ 
yC In n 



< 



1 



C\n 2a n 
1 

Cln 2a n 
In VI 



\aj -Cln 2Q n| 



In 



C ln 2a ( I n V 1) - C ln 2a n + 0(ln A n) 

o- 1 



ln lA(2a-X) n J- 



Analogously, we obtain \r n — 1| = 0(l/ln 2a A ra). 
With 

su Pn>i II l n ((^n V 1 ) /tt.) 1 1 3 < 00, we obtain 



\G n -lh = 



1 



ln lA(2a-A) 



n 



By definition of and (10), we have ||&( n ) || 3 = 0(1/ ln a " K n). For A n , we 
obtain 



\A 



n || 3 



J2 _ n 2 I II - ll-r 2 -^ 2 !! 1 / 2 

'n "nl 113 II n "nil 3/2 



< Ik 2 r 2 n 1/2 

^ ll T n ~~ "nib 



<(k 2 -l| + ||G 2 -ly 1 / 2 



o 



j n (l/2)A(a-A/2) n 



12 



R. NEININGER AND L. RUSCHENDORF 



Collecting the estimates, we bound the right-hand side in (23). Estimating 
there the L2-norms, by L3-norms, we finally obtain 

with 

(3 = | A 3(a - k) A 3(a - A/2) A (a - k + 1) A (3a - k - A). 

Note that this coincides with the representation for (5 in (11) since we have 
3(a — k) A 3(q — A/2) < 3a — k — A. This is seen by distinguishing the cases 
k> A/2 and k< A/2. □ 

In the proof of Theorem 2.1 the limit normal distribution is no longer 
obtained from the limit fixed-point equation as in the usual contraction 
method. Instead, as a substitute, the closure of the normal family under 
convolution used in (21) and (22) allows us to mimic the recurrence satisfied 
by (Z n ), respectively, by the accompanying sequence (Z*) in terms of normal 
quantities. This decomposition allows for estimating ^(Z*,N n ) sufficiently 
tight. It is easy to see that the scaling property in (21) and (22) essentially 
characterizes the normal distribution. More precisely, the following lemma 
explains the occurrence of the normal limit distribution: 

Lemma 3.3 (Characterization of normal distributions). LetX,W be in- 
dependent with mean and variance 1 and assume that for all q £ (0, 1), 

(24) X = qX + V 7 ! - q 2 W. 

Then we have X = J\f(0, 1) . 

Proof. From (24) we obtain for all fixed n > 1, by induction on 1 < 
k < n, that 

where Wi, . . . , W n ,X are independent with Wj = W for all j = 1, . . . , n. 
Thus, with k = n we have 

Vn + l Vn+l^Vn^ J J 
Therefore, the central limit theorem implies X =Jv (0, 1). □ 

Note that a similar scaling property valid for stable distributions, in prin- 
ciple, allows the method of proof of Theorem 2.1 to a stable limit theorem. 
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4. Applications and discussion. In this section we give applications of 
Theorem 2.1. A couple of limit laws obtained before by different means and 
involving specific calculations for each case are covered by Theorem 2.1: 

Unsuccessful search. The cost of an unsuccessful search in a random 
binary search tree with n nodes, as discussed in Cramer and Riischendorf 
(1996) and Mahmoud (1992), satisfies recurrence (1) with I n ~ unif{l, . . . , n — 
1}, b n = 1 for n > 2, and Y = Y\ = 0. We have [see Mahmoud (1992)] 

EY n = 21nn + 0(l), Var(Y n ) = 21nra + O(l) 

and obtain in the notation of Theorem 2.1, 

\K -Hn + IH n \\ 3 = pln(Vn) + 0(l)|| 3 = 0(1). 

Thus, the parameters in Theorem 2.1 are a = 1/2, k = A = and we have 
(3 = 3/2. The technical conditions in (9) are satisfied since ln((/ n V 2)/n) — > 
InU in L3 for a unif [0, 1] random variable U. (Use representations I n = 
\(n — 1)U~\ and decompose the domain of the resulting integral into the 
intervals (i/n, (i + l)/n] for i = 0, . . . ,n— 1.) Theorem 2.1 implies the central 
limit law with a rate of convergence: 

Note that the l/y/\nn rate of convergence for different metrics was shown 
previously in Cramer and Riischendorf (1996) based on calculations involv- 
ing the particular distribution of /„. 

Depths of nodes. The depth of a random node in a random binary search 
tree with n nodes satisfies recurrence (1) with P(/ n = 0) = 1/n and P(J n = 
k) = 2k/n 2 for 1 < k < n — 1 and b n = 1, where n>2 and Yq = —1, Y\ = 0. 
We have [see Mahmoud (1992)] 

EY n = 2 Inn + O(l), Var(y n ) = 2 Inn + O(l), 

and obtain in the notation of Theorem 2.1, 

\K -Vn + = Pln(/ n /n) + 0(1)|| 3 = 0(1). 

Hence, the parameters of Theorem 2.1 are given by a = 1/2, k = A = 
and we obtain (3 = 3/2. The technical conditions in (9) are satisfied since 
ln((I n V2)/n) — ► lnvL^ in L3 for a unif[0, 1] random variable U and Theorem 
2.1 implies the central limit law with a rate of convergence as in (25). 

Mahmoud and Neininger (2003) obtained this rate of convergence via an 
explicit calculation based on the specific distribution of I n and showed 
the optimality of the order l/Vhm, that is, (3((Y n — u n )/a n , Af(0, 1)) = 



14 



R. NEININGER AND L. RUSCHENDORF 



0(1/Vlnn ). This indicates that our estimates in the proof of Theorem 
2.1 are tight. See also Mahmoud and Neininger (2003) for a different dis- 
tributional recurrence satisfied by (Y n ) which leads to the limit equation 

X = BX +(1- B)X', where X,X',B are independent with X,X' being 
identically distributed and B Bernoulli(l/2) distributed. This limit equa- 
tion similar to X 
degenerate type. 



tion similar to X = X is as well satisfied by any distribution, hence, also of 



Broadcast communication. The time (Y n ) of a maximum-finding algo- 
rithm for a broadcast communication model with n processors as analyzed 
in Chen and Hwang [(2003), Algorithm B] satisfies Yq = Y\ = 1 and, for 
n > 2, recurrence (1) with I n ~ unif{0, . . . ,n — 1} and b n being the time 
(= number of rounds) used by a leader election algorithm as discussed in 
Prodinger (1993) and further analyzed in Fill, Mahmoud and Szpankowski 
(1996). We have [see Chen and Hwang (2003)] Ebl = 0(\n 3 n) and 

KY n = n In 2 n + O(lnn), Var(Y n ) = a 2 In 3 n + 0(ln 2 n), 

with positive constants a. A direct calculation gives, after cancellations 
of leading terms, 

\\K - Mn + AiiJl3 = O(fnn). 

Thus, we have a = 3/2, k = 1 and A = 2, which gives (3 = 3/2. This implies 
the following 

Corollary 4.1. The time (Y n ) of Algorithm B in Chen and Hwang 
(2003), as introduced above, satisfies 



Vv/Var(Y n ) 

The same bound for the rate for the Kolmogorov metric was obtained in 
Chen and Hwang (2003). 

5. Extensions and applications. We consider now the more general re- 
currence for (Y n ), as in (1), 

K 

(26) ^ = E^2+ 6 ». n > n o, 

r=l 



where no, K > 1, b n is a random variable, I± , . . . , 1^ S {0, . . . , n} are ran- 
dom indices, and (Y^), . . . , (K, ) distributional copies of (Y^) such that 
(Y^), . . . , ), (ii , • • • , Ik-i bn) are independent. Many examples of 
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divide- and- conquer type algorithms lead to this equation and have been 
considered in the analysis of algorithms literature. 

We introduce the scaling X n := (Y n — fJ, n )/o~ n , where [i n = KY n and a n = 
^/Var(y n ) and obtain as recurrence relation for X n , 



(27) 
where 



X n ±± a -^X% +b M, n> 

r=l ° n 



no, 



&<»>:=- 



V r=l 



and . . . , (if), (/l n) , . . . , lP,b n ) are independent, . . . , (if) 

being distributional copies of (X n ). 

Extensions of Theorem 2.1 in various directions are possible. We give as 
an example a theorem tailored for the case when the coefficients a ( n ) /cr n in 

(27) behave roughly as follows: 



cr („) 

On 



Ai = l, 



A. = 0, 



0"» 

A" to/ 



r = 2,...,K. 



We assume that limsup,^^ X)r=i ^(-^ 
and jU„ = EY n . 



77 



< 1 and denote <7 n = ^/Var(Y n ) 



Theorem 5.1. Assume that (Y n )n>o satisfies the recurrence (26) wrai/i 
|y n ||3 < oo for all n > 0, and 









J <o, 


sup 




n>l 





< OO. 



Furthermore, assume that for real numbers a, A, k with < A < 2a, the mean 
and the variance of Y n satisfy 

K 

= 0{\n K n), a 2 n = C\n 2a n + 0{\u x n) 

3 

with some constant C > and that for some real number £ > ; we have 
||ln a (4 ra ) Vl)|| 3 = 0(ln*n), r = 2,...,K. 

If 

(29) p := | A 3(a - k) A 3(a — £) A 3(a - A/2) A (a - k + 1) > 1, 



r=l 



then 
(30) 



C7 1n Q n 



AT(0,1), 
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and we have the following rate of convergence for the Zolotarev-metric £3: 



Cs 



Y — W.Y 



v/Var(y n ) 



o 



1 



ln /3-l 



For the proof we need a substitute for Lemma 3.1: 



Lemma 5.2. Let l[ n \...,I^ be random variables in {0, ...,n} with 
Y^=i ^{Ir^ = n) < 1 /or a// n sufficiently large, and 

K 



limsupElnf- TT(4 n) V I)] < -e 

\ n r=l J 



for some e > 0. Let (d n ) n >o, (r n ) n >„ be sequences of nonnegative numbers 
with 



d n <E 



K 



r=l v 



L 5 (n) 



oL(n) 



n > no, 



/or some 7 > 1. Then, for all 1 < /3 < 7 and 5 > sufficiently small, we have 



O 



1 



ln^ 3 n 



1 



111 



/3-1 



The proof of Lemma 5.2 follows the argument of the proof of Lemma 
3.1. Note that we have the more restrictive condition 1 < (3 < 7 compared 
to l</5<7+l in Lemma 3.1. This allows for replacing the analog of the 
estimates (15) and (16) in the proof of Lemma 3.1 by 

;E ^ | ln((/{ w) Vl)/n) y | * / ln(Z< w) Vl) y 

Inn / — ^ \ 111 77. ' 



K /ln(/ r W Vi; 



r=l v 



Inn 



<E 



Inn 

r(«) 



r=2 



^((/rviVn) + * ln(/^ V 1) 



Inn 



r=2 



Inn 



<i + -^ElnfiTT(4 n) vi)V 

in n \ n y 



For this we used that 77 = 7 + 1 — /3>1. 

We sketch the extension of the techniques for Theorem 2.1 to obtain 
Theorem 5.1. 

Proof of Theorem 5.1. (Sketch) We have Eln((l/n) U^Lii 1 ^ V 
1)) < — e for all n>n\ with an appropriate e > and n\ > uq. With 5 > 
sufficiently small, we define the scaled quantities 

Y n - KY n 



cL a 5 ( n y 



n>0, 
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and denote r n := y/\ax(Z n ) = a n / (yCLf(n)); thus, r n —* 1 for n — > oo. We 
have the recurrence 

with 

6W = 77^7 ^ ( bw " ^ n + £ ^ (n) J • 
VCL s (n)\ r=1 >r J 

We define := r n N^ , where JVW, . . . , AfW are standard normal dis- 
tributed random variables such that b n ),N^ 1 ' , • • • , are independent. 
Also, we introduce an accompanying sequence (Z*) by 



_^/L5(^)\ a (r) (n) 



m. 



Then, with d n = C,z{Z n ,N n ) and r n = £z(Z*,Nn), we obtain similarly to 
the argument in the proof of Theorem 2.1, 



dL <E 



For the estimate of r n = (s(Z*,N n ), we define 



and use the representations 



A:={G n >T n }, A n :=V\G 2 n -T%\ 



z* n =i A \r, 



iV n £ l A (r n JVW) + l A o{G n N^ + A n N'), 

where N' is standard normal distributed and independent of the other ran- 
dom variates. With corresponding estimates, as in the proof of Theorem 1.1, 
we find 



C3(Z* n ,N n ) = o[\r n - 1| 3 + ||A n ||| + + \\G n - 1| 
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K 



+ ||&W|| 2 (|r„ - 1| + \\G n - 1|| 2 ) + -I- EH lnQ (4 n) V 1) 

m n ^ 



1 



, In' 3 n / ' 

with (3 given in (29). Since (3 < 3a, Lemma 5.2 completes the proof. □ 

As applications of Theorem 5.1 we discuss various cost measures (Y n ) for 
a maximum finding algorithm in a broadcast communication model with n 
processors as analyzed in Chen and Hwang [(2003), Algorithm A]. We use 
their expansions for mean and variance and, by Theorem 5.1, rederive the 
central limit laws. Additionally, we endow them with new rates of conver- 
gences. Several cost measures (Y n ) of this algorithm satisfy the recurrence 

(31) Yn = ygj, + Yf n) +b n , n> 2, 

with relations as in (26), where b n varies for different cost measures, whereas 
the distribution of the indices (l[ n ^ , 1% ) is in all cases given by 

nUl ,h )-{J^))-<y { n-k-^ 2 -n^ k>0,l<j<n-k. 

In particular, we have that the marginal is binomial Bin, 1/2) dis- 
tributed and 

r(-)_n_/5 + 2 " n . k = U> 

< k+1 \ Kk<n-1. 



P(4 n) = *) = { = 



The technical conditions in Theorem 5.1 regarding the indices (i^ , I2 ) are > 
hence, satisfied: We have P(l| n) =n)+ F(I^ n) = n) = 2~ n < 1 for all n > 1 
and ||ln((l{ n) V l)/n)|| 3 -» ln2 since is binomial B(n, 1/2) distributed, 
thus, we have sup n>1 ||ln((/-( n ^ V l)/ra)||3 < 00. For the verification of the 
first condition in (28) note that we have Kln((l[ n ^ V l)/n) — ► — In 2 and, 
therefore, it is sufficient to show limsupn^^ Eln(4 n) V 1) < ln2. We have 

„ . („) , 2^ Ink ln2 k ln2 1 

Eln(4 n) Vl) = 5:^TT< — + Ep = J + 2<0.6<ln2. 

Chen and Hwang (2003) analyze three cost measures, namely, the time 
(= number of rounds) taken by the algorithm, the number of coin flips 
performed and the number of comparisons performed. The number of coin 

flips does not lead to a degenerate limit equation X = X and can be treated 
by standard application of the contraction method, see, for example, Rosier 
(2001). We focus on the other two more delicate parameters: 
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Time of the algorithm. The time (Y n ) of the maximum finding Algorithm 
A analyzed in Chen and Hwang (2003) satisfies (31) with b n = 1 and Yq = 
Y\ = 1. Mean and variance satisfy, see Chen and Hwang (2003), 

EY„ = /Unn + 0(1), Var(Y n ) = a 1 Inn + 0(1), 

with constants /2, & > being explicitly known. Hence, in the notation of 
Theorem 5.1 we have 

\\b n -Hn + + MjW ll 3 = O(l), W^' 2 ^ V l)|| 3 = O(l), 

using that also || ln^" V 1 ) 1 1 3 = O(l). Thus, we have a = 1/2 and k = A = 
£ = 0, which gives (3 = 3/2. With Theorem 5.1 we rederive the central limit 
law and add the following rate of convergence: 

Corollary 5.3. The time (= number of rounds) (Y n ) of the maxi- 
mum finding Algorithm A in Chen and Hwang (2003), as introduced above, 
satisfies 

Number of comparisons. The number of comparisons (Y n ) of the maxi- 
mum finding Algorithm A was analyzed in Chen and Hwang (2003). It sat- 
isfies (31) with b n = n — l[ n ^ and Yq = Y\ = 0. Mean and variance have the 
expansions, see Chen and Hwang (2003), 

EY n = n + /2hm + 0(1), Var(Y n ) = a 2 Inn + 0(1), 

with constants p,, a > being explicitly known. Hence, in the notation of 
Theorem 5.1 we obtain, after cancelation, 

\\b n ~Vn + M j( n) + hm> || 3 = ||Aln((4 n) V l)/n) + 4 n) + uln4 n) + 0(1)|| 3 

= 0(1). 

Thus, we have a = 1/2 and k = A = £ = 0, which gives j3 = 3/2. Theorem 5.1 
rederives the central limit law and adds a rate of convergence: 

Corollary 5.4. The number of comparisons (Y n ) of the maximum 
finding Algorithm A in Chen and Hwang (2003), as introduced above, sat- 
isfies 
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